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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(1) APPLICANT: RUEGER, DAVID C 

KUBERASAMPATH, THANGAVEL 
OPPERMANN, HERMANN 
OZKAYNAK ENGIN 
PANG, ROY H.L. 
COHEN, CHARLES H. 

W mmi10 '" ""MWWWWD NERVE REGENERATION AND 

(iii) NUMBER OF SEQUENCES: 33 

(iv) CORRESPONDENCE ADDRESS- 

(B) STREET**^"} Hilt' ' HURBITZ & THIBEAULT 
(D) biREET: 53 STATE STREET 

(C) CITY: BOSTON 

(D) STATE: MASSACHUSETTS 

(E) COUNTRY: U.S.A. 

(F) ZIP: 02109 

(v) COMPUTER READABLE FORM- 

(A) MEDIUM TYPE: Floppy disk 
B COMPUTER: IBM Pc5.£Sble 

D S°0fS G p SYSTEM: PC - D OS/MS e -DOS 

S0FTWARE: Patent In Please #1.0, Version #1.25 
(vii) PRIOR APPLICATION DATA- 

(A) APPLICATION NUMBER: US 667 274 

(B) FILING DATE: ll-MAR-1991 

(vii) PRIOR APPLICATION DATA- 

IbJ nKr KBER! us ™ 

(2) INFORMATION FOR SEQ ID NO: 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 97 amino acids 

(B) TYPE: amino acids 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(ix) FEATURE : 

(A) NAME : Generic Sequence 1 
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(D) OTHER INFORMATION: Each Xaa 

indicates one of the 20 naturally- 
occurring L-isomer, a-amino acids 
or a derivative thereof, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Xaa Xaa Xaa Xaa Xaa Xaa 



1 5 



Xaa xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

io 15 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys Xaa Xaa Xaa 
20 25 

Cys xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
30 35 

Xaa xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
40 45 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cyl 

55 60 
Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

65 70 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 



75 80 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys 



85 90 



Xaa Cys Xaa 
95 



INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acids 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
( ix ) FEATURE : 

(A) NAME : Generic Sequence 2 
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(D) OTHER INFORMATION: Each Xaa 

indicates one of the 20 naturally- 
occurring L-isomer, a-amino acids 
or a derivative thereof, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Xaa Xaa Xaa Xaa Xaa Xaa 
1 5 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

10 15 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys Xaa Xaa Xaa 

20 25 
Cys Xaa Xaa Xaa Xaa Xaa Xaa Cys Xaa Xaa Xaa 

30 35 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
40 45 50 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys 

55 60 
Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

65 70 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

75 80 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys 

85 90 
Xaa Cys Xaa 
95 



INFORMATION FOR SEQ ID NO: 3: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acids 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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( ix ) FEATURE : 

(A) NAME: Generic Sequence 3 
(D) OTHER INFORMATION: wherein each 
Xaa is independently selected from 
a group of one or more specified 
amino acids as defined in the 
specification. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



Leu Tyr Val Xaa Phe 
1 5 
Xaa Xaa Xaa Gly Trp Xaa Xaa Trp Xaa 

10 

Xaa Ala Pro Gly Xaa Xaa Xaa Ala 

15 20 
Xaa Tyr Cys Xaa Gly Xaa Cys Xaa 
25 30 
Xaa Pro Xaa Xaa Xaa Xaa Xaa 

35 

Xaa Xaa Xaa Asn His Ala Xaa Xaa 
40 45 
Xaa Xaa Leu Xaa Xaa Xaa Xaa Xaa 

50 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys 

55 60 
Cys Xaa Pro Xaa Xaa Xaa Xaa Xaa 
65 

Xaa Xaa Xaa Leu Xaa Xaa Xaa 

70 75 
Xaa Xaa Xaa Xaa Val Xaa Leu Xaa 
80 

Xaa Xaa Xaa Xaa Met Xaa Val Xaa 
85 90 
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Xaa Cys Gly Cys Xaa 
95 

INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acids 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(ix) FEATURE: 

(A) NAME: Generic Sequence 4 

(D) OTHER INFORMATION: wherein each 
Xaa is independently selected from 
a group of one or more specified 
amino acids as defined in the 
specification. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Cys Xaa Xaa Xaa Xaa Leu Tyr Val Xaa Phe 

Xaa Xaa Xaa Gly Trp Xaa Xaa Trp Xaa " 

15 

Xaa Ala Pro Xaa Gly Xaa Xaa Ala 

20 25 
Xaa Tyr Cys Xaa Gly Xaa Cys Xaa 

30 35 
Xaa Pro Xaa Xaa Xaa Xaa Xaa 



40 



Asn xaa Xaa Asn His Ala Xaa Xaa 

45 50 
Xaa Xaa Leu Xaa Xaa Xaa Xaa Xaa 

55 
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Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys 

60 65 
Cys Xaa Pro Xaa Xaa Xaa Xaa Xaa 
70 

Xaa Xaa Xaa Leu Xaa Xaa Xaa 

75 80 
Xaa Xaa Xaa Xaa Val Xaa Leu Xaa 
85 

Xaa Xaa Xaa Xaa Met Xaa Val Xaa 
90 95 

Xaa Cys Gly Cys Xaa 
100 

INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 139 amino acids 

(B) TYPE: amino acids 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
( ix ) FEATURE : 

(A) NAME: hOP-1 (mature form) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



Ser Thr Gly Ser Lys Gin Arg Ser Gin 

1 5 

Asn Arg Ser Lys Thr Pro Lys Asn Gin 

10 15 

Glu Ala Leu Arg Met Ala Asn Val Ala 

20 25 

Glu Asn Ser Ser Ser Asp Gin Arg Gin 

30 35 

Ala Cys Lys Lys His Glu Leu Tyr Val 

40 45 
Ser Phe Arg Asp Leu Gly Trp Gin Asp 
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* 



Trp He He 
55 

Ala Tyr Tyr 
65 

Phe Pro Leu 
75 

Thr Asn His 

Val His Phe 

Pro Lys Pro 
100 

Leu Asn Ala 
110 

Asp Asp Ser 
120 

Lys Tyr Arg 

Cys Gly Cys 



Ala Pro Glu Gly Tyr Ala 
60 

Cys Glu Gly Glu Cys Ala 

70 

Asn Ser Tyr Met Asn Ala 

80 

Ala He Val Gin Thr Leu 
85 90 
He Asn Pro Glu Thr Val 
95 

Cys Cys Ala Pro Thr Gin 
105 

He Ser Val Leu Tyr Phe 
115 

Ser Asn Val lie Leu Lys 

125 

Asn Met Val Val Arg Ala 
130 135 
His 



INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 139 amino acids 

(B) TYPE: amino acids 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
( ix ) FEATURE : 

(A) NAME: mOP-1 (mature form) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



Ser Thr Gly Gly Lys Gin Arg Ser Gin 
1 5 
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Asn Arg Ser Lys Thr Pro Lys Asn Gin 

10 15 

Glu Ala Leu Arg Met Ala Ser Val Ala 

20 25 

Glu Asn Ser Ser Ser Asp Gin Arg Gin 

30 35 

Ala Cys Lys Lys His Glu Leu Tyr Val 

40 45 

Ser Phe Arg Asp Leu Gly Trp Gin Asp 

50 

Trp He He Ala Pro Glu Gly Tyr Ala 

55 60 

Ala Tyr Tyr Cys Glu Gly Glu Cys Ala 
65 70 

Phe Pro Leu Asn Ser Tyr Met Asn Ala 

75 80 

Thr Asn His Ala He Val Gin Thr Leu 

85 9Q 

Val His Phe He Asn Pro Asp Thr Val 

95 

Pro Lys Pro Cys Cys Ala Pro Thr Gin 

100 105 

Leu Asn Ala He Ser Val Leu Tyr Phe 

no 115 

Asp Asp Ser Ser Asn Val He Leu Lys 

120 125 
Lys Tyr Arg Asn Met Val val Arg Ala 
130 

Cys Gly Cys His 



130 135 



INFORMATION FOR SEQ ID NO: 7: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 139 amino acids 

(B) TYPE: amino acids 

(C) TOPOLOGY: linear 




(ii) MOLECULE TYPE: protein 
(ix) FEATURE: 

(A) NAME: hOP-2 (mature form) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Ala Val Arg Pro Leu Arg Arg Arg Gin 

1 5 
Pro Lys Lys Ser Asn Glu Leu Pro Gin 

10 15 
Ala Asn Arg Leu Pro Gly He Phe Asp 

20 25 
Asp Val His Gly Ser His Gly Arg Gin 

30 35 
Val Cys Arg Arg His Glu Leu Tyr Val 

40 45 
Ser Phe Gin Asp Leu Gly Trp Leu Asp 

50 

Trp Val He Ala Pro Gin Gly Tyr Ser 

55 60 
Ala Tyr Tyr Cys Glu Gly Glu Cys Ser 

65 70 
Phe Pro Leu Asp Ser Cys Met Asn Ala 

75 80 
Thr Asn His Ala He Leu Gin Ser Leu 

85 90 
Val His Leu Met Lys Pro Asn Ala Val 

95 

Pro Lys Ala Cys Cys Ala Pro Thr Lys 
100 105 

Leu Ser Ala Thr Ser Val Leu Tyr Tyr 

HO 115 

Asp Ser Ser Asn Asn Val He Leu Arg 

120 125 

Lys His Arg Asn Met Val Val Lys Ala 
130 135 

Cys Gly Cys His 
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INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 139 amino acids 

(B) TYPE: amino acids 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
( ix ) FEATURE : 

(A) NAME: mOP-2 (mature form) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Ala Ala Arg Pro Leu Lys Arg Arg Gin 

1 5 
Pro Lys Lys Thr Asn Glu Leu Pro His 

10 15 
Pro Asn Lys Leu Pro Gly He Phe Asp 

20 25 
Asp Gly His Gly Ser Arg Gly Arg Glu 

30 35 
Val Cys Arg Arg His Glu Leu Tyr Val 

40 45 
Ser Phe Arg Asp Leu Gly Trp Leu Asp 

50 

Trp Val lie Ala Pro Gin Gly Tyr Ser 

55 60 
Ala Tyr Tyr Cys Glu Gly Glu Cys Ala 

65 70 
Phe Pro Leu Asp Ser Cys Met Asn Ala 

75 80 
Thr Asn His Ala lie Leu Gin Ser Leu 

85 90 
Val His Leu Met Lys Pro Asp Val Val 

95 

Pro Lys Ala Cys Cys Ala Pro Thr Lys 
100 105 



• # 

- 108 - 



Leu 


Ser 
110 


Ala 


Thr 


Ser 


Val 


Asp 


Ser 


Ser 
120 


Asn 


Asn 


Val 


Lys 


His 


Arg 


Asn 
130 


Met 


Val 


Cys 


Gly 


Cys 


His 







115 

He Leu Arg 
125 

Val Lys Ala 
135 



INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE : amino acids 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(ix) FEATURE: 

(A) NAME: CBMP-2A(fx) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

CYS Lys Arg His Pro Leu Tyr s ^ 

1 5 
ASP Val Gly Trp Asn ftsp Irp ne m » 

Pro Gly Tyr His Ala Phe Tyr Cys His Gly Glu 

cys Pro Phe Pro ^ Ala Asp Hi 3 s ° am 

Thr Asn His Wa Ile val G1 4 „° Thr 

50 S5 
Ser val Asn Ser Lys lie Pro Lys Ala Cys Cys 



60 65 
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Val Pro Thr Glu Leu Ser Ala He Ser Met Leu 

70 75 
Tyr Leu Asp Glu Asn Glu Lys Val Val Leu Lys 

80 85 
Asn Tyr Gin Asp Met Val Val Glu Gly Cys Gly 

90 95 
Cys Arg 
100 



INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acids 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
( ix ) FEATURE : 

(A) NAME: CBMP-2B(fx) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Cys Arg Arg His Ser 
1 5 
Leu Tyr Val Asp Phe Ser Asp Val Gly Trp Asn 

10 15 
Asp Trp He Val Ala Pro Pro Gly Tyr Gin Ala 

20 25 
Phe Tyr Cys His Gly Asp Cys Pro Phe Pro Leu 

30 35 
Ala Asp His Leu Asn Ser Thr Asn His Ala He 

40 45 
Val Gin Thr Leu Val Asn Ser Val Asn Ser Ser 
50 55 60 

He Pro Lys Ala Cys Cys Val Pro Thr Glu Leu 

65 70 
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Ser Ala lie Ser Met Leu Tyr Leu Asp Glu Tyr 

75 80 
Asp Lys Val Val Leu Lys Asn Tyr Gin Glu Met 

85 90 
Val Val Glu Gly Cys Gly Cys Arg 
95 100 



INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acids 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
( ix ) FEATURE : 

(A) NAME: DPP(fx) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Cys Arg Arg His Ser Leu Tyr Val Asp Phe Ser 

15 10 
Asp Val Gly Trp Asp Asp Trp lie Val Ala Pro 

15 20 
Leu Gly Tyr Asp Ala Tyr Tyr Cys His Gly Lys 

25 30 
Cys Pro Phe Pro Leu Ala Asp His Phe Asn Ser 

35 40 
Thr Asn His Ala Val Val Gin Thr Leu Val Asn 
45 50 55 

Asn Asn Asn Pro Gly Lys Val Pro Lys Ala Cys 

60 65 
Cys Val Pro Thr Gin Leu Asp Ser Val Ala Met 
70 75 
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Leu Tyr Leu Asn Asp Gin Ser Thr Val Val Leu 

80 85 
Lys Asn Tyr Gin Glu Met Thr Val Val Gly Cys 

90 95 
Gly Cys Arg 
100 



INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acids 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
( ix ) FEATURE : 

(A) NAME: Vgl(fx) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Cys Lys Lys Arg His Leu Tyr Val Glu Phe Lys 

1 5 10 

Asp Val Gly Trp Gin Asn Trp Val He Ala Pro 

15 20 
Gin Gly Tyr Met Ala Asn Tyr Cys Tyr Gly Glu 

25 30 
Cys Pro Tyr Pro Leu Thr Glu He Leu Asn Gly 

35 40 
Ser Asn His Ala He Leu Gin Thr Leu Val His 
45 50 55 

Ser He Glu Pro Glu Asp He Pro Leu Pro Cys 

60 65 
Cys Val Pro Thr Lys Met Ser Pro He Ser Met 
70 75 



• tf 
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Leu Phe Tyr Asp Asn Asn Asp Asn Val Val Leu 

80 85 
Arg His Tyr Glu Asn Met Ala Val Asp Glu Cys 

90 95 
Gly Cys Arg 
100 



INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acids 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
( ix ) FEATURE : 

(A) NAME: Vgr-l(fx) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Cys Lys Lys His Glu Leu Tyr Val Ser Phe Gin 

1 5 io 

Asp Val Gly Trp Gin Asp Trp He He Ala Pro 

15 20 
Xaa Gly Tyr Ala Ala Asn Tyr Cys Asp Gly Glu 

25 30 
Cys Ser Phe Pro Leu Asn Ala His Met Asn Ala 

35 40 
Thr Asn His Ala He Val Gin Thr Leu Val His 
45 50 55 

Val Met Asn Pro Glu Tyr Val Pro Lys Pro Cys 

60 65 
Cys Ala Pro Thr Lys Val Asn Ala He Ser Val 
70 75 



- 113 - 



Leu Tyr Phe Asp Asp Asn Ser Asn Val He Leu 

80 85 
Lys Lys Tyr Arg Asn Met Val Val Arg Ala Cys 

90 95 
Gly Cys His 
100 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: protein 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 
(A) ORGANISM: human 
(F) TISSUE TYPE: BRAIN 

(ix) FEATURE: 
(D) OTHER INFORMATION: 

/product= "GDF-1 (fx)" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Cys Arg Ala Arg Arg Leu Tyr Val Ser Phe Arg Glu Val Gly" 
1 5 10 

TrD His Arg Trp Val He Ala Pro Arg Gly Phe Leu Ala Asn Tyr 
15 20 25 

Cys Gin Gly Gin Cys Ala Leu Pro Val Ala Leu Ser Gly Ser Gly 
30 35 40 

Glv Pro Pro Ala Leu Asn His Ala Val Leu Arg Ala Leu Met His 
45 50 55 

Ala Ala Ala Pro Gly Ala Ala Asp Leu Pro Cys Cys Val Pro Ala 
60 65 70 

Arg Leu Ser Pro He Ser Val Leu Phe Phe Asp Asn Ser Asp Asn 
75 80 85 

Val Val Leu Arg Gin Tyr Glu Asp Met Val Val Asp Glu Cys Gly 
90 95 100 

Cys Arg 
105 
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(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Cys Xaa Xaa Xaa Xaa 
1 5 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1822 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HOMO SAPIENS 
(F) TISSUE TYPE: HIPPOCAMPUS 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 49.. 1341 

(D) OTHER INFORMATION: /standard_name= "hOPl" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

GGTGCGGGCC CGGAGCCCGG AGCCCGGGTA GCGCGTAGAG CCGGCGCG ATG CAC GTG 57 

Met His Val 
1 

CGC TCA CTG CGA GCT GCG GCG CCG CAC AGC TTC GTG GCG CTC TGG GCA 105 
Arg Ser Leu Arg Ala Ala Ala Pro His Ser Phe Val Ala Leu Trp Ala 
5 10 15 

CCC CTG TTC CTG CTG CGC TCC GCC CTG GCC GAC TTC AGC CTG GAC AAC 153 
Pro Leu Phe Leu Leu Arg Ser Ala Leu Ala Asp Phe Ser Leu Asp Asn 
20 25 30 35 

GAG GTG CAC TCG AGC TTC ATC CAC CGG CGC CTC CGC AGC CAG GAG CGG 201 
Glu Val His Ser Ser Phe He His Arg Arg Leu Arg Ser Gin Glu Arg 
40 45 50 



r 
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CGG GAG ATG CAG CGC GAG ATr rrr vrr .n-n ^ 

«. «. irg cau ' £ « £ « ™ ooc ™ ca ac ooc 

S Pro £ 2 Gin G % C £ £ J* « <*A CCC ATG TTC ATG 
70 bly Hls Asn Ser Ala Pro Met Phe Met 

° 80 

2 5 2 55 2 2 2 2 & 2 2 2 2 2 2 2 

£ a S22SSSS2S2£ ff g;« 
S5=2£=22222 =S5S 5 

130 

2 2 2 £ ™ Z £ CTC CTC GAA CAT GAC AAG CM TTC TTC 

135 L6U JfjJ Glu H " Asp Lys Glu Phe Phe 

145 

- 2 2 2 2 2: 2 2 2 2 s r r ** « « 

150 B rne Ar H p he Asp Leu Ser Lys He 

33 160 

- S 2 o c ^ 2 2 2 2 2 2 2 2 g £ - « 
£ - 2 2 2 2 2 2 2 2 2 2 2 2 2 2 

2 2 2 2 2 2 2 2 2 2 £ r f ™ ™ <* 

200 7 rg Ser As P Leu Phe Leu Leu 

3 210 

- S S 2 2 2 2 2 2 2 2 2 2 2 2 2 

110 225 

- - 2 2 2 2 2 2 2 2 2 2 2 2 2 2 

J3 240 

S 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 



249 



297 



345 



393 



441 



489 



537 



585 



633 



681 



729 



777 



825 



r 



• 
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AAG TTG GCG GGC CTG ATT GGG CGG CAC GGG CCC CAG AAC AAG CAG CCC 873 
Lys Leu Ala Gly Leu He Gly Arg His Gly Pro Gin Asn Lys Gin Pro 
260 265 270 275 

TTC ATG GTG GCT TTC TTC AAG GCC ACG GAG GTC CAC TTC CGC AGC ATC 921 
Phe Met Val Ala Phe Phe Lys Ala Thr Glu Val His Phe Arg Ser He 
280 285 290 

CGG TCC ACG GGG AGC AAA CAG CGC AGC CAG AAC CGC TCC AAG ACG CCC 969 
Arte Ser Thr Gly Ser Lys Gin Arg Ser Gin Asn Arg Ser Lys Thr Pro 
B 295 300 305 

AAG AAC CAG GAA GCC CTG CGG ATG GCC AAC GTG GCA GAG AAC AGC AGC 1017 
Lys Asn Gin Glu Ala Leu Arg Met Ala Asn Val Ala Glu Asn Ser Ser 
310 315 320 

AGC GAC CAG AGG CAG GCC TGT AAG AAG CAC GAG CTG TAT GTC AGC TTC 1065 
Ser Asp Gin Arg Gin Ala Cys Lys Lys His Glu Leu Tyr Val Ser Phe 
325 330 335 

CGA GAC CTG GGC TGG CAG GAC TGG ATC ATC GCG CCT GAA GGC TAC GCC 1113 
Arg Asp Leu Gly Trp Gin Asp Trp He He Ala Pro Glu Gly Tyr Ala 
340 345 350 355 

GCC TAC TAC TGT GAG GGG GAG TGT GCC TTC CCT CTG AAC TCC TAC ATG 1161 
Ala Tyr Tyr Cys Glu Gly Glu Cys Ala Phe Pro Leu Asn Ser Tyr Met 
360 365 370 

AAC GCC ACC AAC CAC GCC ATC GTG CAG ACG CTG GTC CAC TTC ATC AAC 1209 
Asn Ala Thr Asn His Ala He Val Gin Thr Leu Val His Phe He Asn 
375 380 385 

CCG GAA ACG GTG CCC AAG CCC TGC TGT GCG CCC ACG CAG CTC AAT GCC 1257 
Pro Glu Thr Val Pro Lys Pro Cys Cys Ala Pro Thr Gin Leu Asn Ala 
390 395 400 

ATC TCC GTC CTC TAC TTC GAT GAC AGC TCC AAC GTC ATC CTG AAG AAA 1305 
He Ser Val Leu Tyr Phe Asp Asp Ser Ser Asn Val He Leu Lys Lys 
405 410 415 

TAC AGA AAC ATG GTG GTC CGG GCC TGT GGC TGC CAC TAGCTCCTCC 1351 
Tyr Arg Asn Met Val Val Arg Ala Cys Gly Cys His 
420 425 430 

GAGAATTCAG ACCCTTTGGG GCCAAGTTTT TCTGGATCCT CCATTGCTCG CCTTGGCCAG 1411 

GAACCAGCAG ACCAACTGCC TTTTGTGAGA CCTTCCCCTC CCTATCCCCA ACTTTAAAGG 1471 

TGTGAGAGTA TTAGGAAACA TGAGCAGCAT ATGGCTTTTG ATCAGTTTTT CAGTGGCAGC 1531 

ATCCAATGAA CAAGATCCTA CAAGCTGTGC AGGCAAAACC TAGCAGGAAA AAAAAACAAC 1591 



- 117 - 



GCATAAAGAA AAATGGCCGG GCCAGGTCAT TGGCTGGGAA GTCTCAGCCA TGCACGGACT 1651 

CGTTTCCAGA GGTAATTATG AGCGCCTACC AGCCAGGCCA CCCAGCCGTG GGAGGAAGGG 1711 

GGCGTGGCAA GGGGTGGGCA CATTGGTGTC TGTGCGAAAG GAAAATTGAC CCGGAAGTTC 1771 

CTGTAATAAA TGTCACAATA AAACGAATGA ATGAAAAAAA AAAAAAAAAA A 1822 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 431 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(D) OTHER INFORMATION: /Product="OPl-PP" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Met His Val Arg Ser Leu Arg Ala Ala Ala Pro His Ser Phe Val Ala 
15 10 15 

Leu Trp Ala Pro Leu Phe Leu Leu Arg Ser Ala Leu Ala Asp Phe Ser 
20 25 30 

Leu Asp Asn Glu Val His Ser Ser Phe He His Arg Arg Leu Arg Ser 
35 40 45 

Gin Glu Arg Arg Glu Met Gin Arg Glu He Leu Ser He Leu Gly Leu 
50 55 60 

Pro His Arg Pro Arg Pro His Leu Gin Gly Lys His Asn Ser Ala Pro 
65 70 75 80 

Met Phe Met Leu Asp Leu Tyr Asn Ala Met Ala Val Glu Glu Gly Gly 
85 90 95 

Gly Pro Gly Gly Gin Gly Phe Ser Tyr Pro Tyr Lys Ala Val Phe Ser 
100 105 110 

Thr Gin Gly Pro Pro Leu Ala Ser Leu Gin Asp Ser His Phe Leu Thr 
115 120 125 

Asp Ala Asp Met Val Met Ser Phe Val Asn Leu Val Glu His Asp Lys 
130 135 140 

Glu Phe Phe His Pro Arg Tyr His His Arg Glu Phe Arg Phe Asp Leu 
145 150 155 160 
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Ser Lys He Pro Glu Gly Glu Ala Val Thr Ala Ala Glu Phe Arg He 
165 170 175 

Tyr Lys Asp Tyr He Arg Glu Arg Phe Asp Asn Glu Thr Phe Arg He 
180 185 190 

Ser Val Tyr Gin Val Leu Gin Glu His Leu Gly Arg Glu Ser Asp Leu 
195 200 205 

Phe Leu Leu Asp Ser Arg Thr Leu Trp Ala Ser Glu Glu Gly Trp Leu 
210 215 220 

Val Phe Asp He Thr Ala Thr Ser Asn His Trp Val Val Asn Pro Arg 
225 230 235 240 

His Asn Leu Gly Leu Gin Leu Ser Val Glu Thr Leu Asp Gly Gin Ser 
245 250 255 

He Asn Pro Lys Leu Ala Gly Leu He Gly Arg His Gly Pro Gin Asn 

260 265 270 

Lys Gin Pro Phe Met Val Ala Phe Phe Lys Ala Thr Glu Val His Phe 
275 280 285 

Arg Ser He Arg Ser Thr Gly Ser Lys Gin Arg Ser Gin Asn Arg Ser 
290 295 300 

Lys Thr Pro Lys Asn Gin Glu Ala Leu Arg Met Ala Asn Val Ala Glu 
305 310 315 320 

Asn Ser Ser Ser Asp Gin Arg Gin Ala Cys Lys Lys His Glu Leu Tyr 
325 330 335 

Val Ser Phe Arg Asp Leu Gly Trp Gin Asp Trp He He Ala Pro Glu 
340 345 350 

Gly Tyr Ala Ala Tyr Tyr Cys Glu Gly Glu Cys Ala Phe Pro Leu Asn 
355 360 365 

Ser Tyr Met Asn Ala Thr Asn His Ala He Val Gin Thr Leu Val His 
370 375 380 

Phe He Asn Pro Glu Thr Val Pro Lys Pro Cys Cys Ala Pro Thr Gin 
385 390 395 400 

Leu Asn Ala He Ser Val Leu Tyr Phe Asp Asp Ser Ser Asn Val He 
405 410 415 

Leu Lys Lys Tyr Arg Asn Met Val Val Arg Ala Cys Gly Cys His 
420 425 430 



« * 
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(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1873 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: MURIDAE 
(F) TISSUE TYPE: EMBRYO 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 104.. 1393 tmutx „ 
(D) OTHER INFORMATION: /note= "MOP1 (CDNA)" 



□ 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CTGCAGCAAG TGACCTCGGG TCGTGGACCG CTGCCCTGCC CCCTCCGCTG CCACCTGGGG 
CGGCGCGGGC CCGGTGCCCC GGATCGCGCG TAGAGCCGGC GCG ATG CAC GTG CGC 



1 



TCG CTG CGC GCT GCG GCG CCA CAC AGC TTC GTG GCG CTC TGG GCG CCT 
Ser Leu Arg Ala Ala Ala Pro His SerPhe Val Ala Leu Trp Ala Pro 
5 10 15 ZV 

CTG TTC TTG CTG CGC TCC GCC CTG GCC GAT TTC AGC CTG GAC AAC GAG 
Leu Phe Leu Leu Arg Ser Ala Leu Ala Asp Phe Ser Leu Asp Asn Glu 

30 35 



25 



GAC CTG TAC AAC GCC ATG GCG GTG GAG GAG AGC GGG CCG GAC GGA CAG 
Asp Leu Tyr Asn Ala Met Ala Val Glu Glu Ser Gly Pro Asp Gly Gin 
85 90 95 iuw 



60 
115 

163 

211 



GTG CAC TCC AGC TTC ATC CAC CGG CGC CTC CGC AGC CAG GAG CGG CGG 259 
Val His Ser Ser Phe lie His Arg Arg Leu Arg Ser Gin Glu Arg Arg 

45 dU 



GAG ATG CAG CGG GAG ATC CTG TCC ATC TTA GGG TTG CCC CAT CGC CCG 307 
Glu Met Gin Arg Glu lie Leu Ser lie Leu Gly Leu Pro His Arg Pro 
55 60 65 

CGC CCG CAC CTC CAG GGA AAG CAT AAT TCG GCG CCC ATG TTC ATG TTG 355 
Arg Pro His Leu Gin Gly Lys His Asn Ser Ala Pro Met Phe Het Leu 
70 75 80 



403 



• 6 
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GGC TTC TCC TAC CCC TAC AAG GCC GTC TTC AGT ACC CAG GGC CCC CCT 451 
Gly Phe Ser Tyr Pro Tyr Lys Ala Val Phe Ser Thr Gin Gly Pro Pro 
105 110 115 

TTA GCC AGC CTG CAG GAC AGC CAT TTC CTC ACT GAC GCC GAC ATG GTC 499 
Leu Ala Ser Leu Gin Asp Ser His Phe Leu Thr Asp Ala Asp Met Val 
120 125 130 

ATG AGC TTC GTC AAC CTA GTG GAA CAT GAC AAA GAA TTC TTC CAC CCT 547 

Met Ser Phe Val Asn Leu Val Glu His Asp Lys Glu Phe Phe His Pro 

135 140 145 

CGA TAC CAC CAT CGG GAG TTC CGG TTT GAT CTT TCC AAG ATC CCC GAG 595 

Arg Tyr His His Arg Glu Phe Arg Phe Asp Leu Ser Lys He Pro Glu 
150 155 160 

GGC GAA CGG GTG ACC GCA GCC GAA TTC AGG ATC TAT AAG GAC TAC ATC 643 
Gly Glu Arg Val Thr Ala Ala Glu Phe Arg He Tyr Lys Asp Tyr He 
165 170 175 180 

CGG GAG CGA TTT GAC AAC GAG ACC TTC CAG ATC ACA GTC TAT CAG GTG 691 
Arg Glu Arg Phe Asp Asn Glu Thr Phe Gin He Thr Val Tyr Gin Val 
185 190 195 

CTC CAG GAG CAC TCA GGC AGG GAG TCG GAC CTC TTC TTG CTG GAC AGC 739 
Leu Gin Glu His Ser Gly Arg Glu Ser Asp Leu Phe Leu Leu Asp Ser 
200 205 210 

CGC ACC ATC TGG GCT TCT GAG GAG GGC TGG TTG GTG TTT GAT ATC ACA 787 
Arg Thr He Trp Ala Ser Glu Glu Gly Trp Leu Val Phe Asp He Thr 
215 220 225 

GCC ACC AGC AAC CAC TGG GTG GTC AAC CCT CGG CAC AAC CTG GGC TTA 835 
Ala Thr Ser Asn His Trp Val Val Asn Pro Arg His Asn Leu Gly Leu 
230 235 240 

CAG CTC TCT GTG GAG ACC CTG GAT GGG CAG AGC ATC AAC CCC AAG TTG 883 
Gin Leu Ser Val Glu Thr Leu Asp Gly Gin Ser He Asn Pro Lys Leu 
245 250 255 260 

GCA GGC CTG ATT GGA CGG CAT GGA CCC CAG AAC AAG CAA CCC TTC ATG 931 
Ala Gly Leu He Gly Arg His Gly Pro Gin Asn Lys Gin Pro Phe Met 
265 270 275 

GTG GCC TTC TTC AAG GCC ACG GAA GTC CAT CTC CGT AGT ATC CGG TCC 979 
Val Ala Phe Phe Lys Ala Thr Glu Val His Leu Arg Ser He Arg Ser 
280 285 290 

ACG GGG GGC AAG CAG CGC AGC CAG AAT CGC TCC AAG ACG CCA AAG AAC 1027 
Thr Gly Gly Lys Gin Arg Ser Gin Asn Arg Ser Lys Thr Pro Lys Asn 
295 300 305 



l_ff 
01 
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2 2 2 2 2 2 2 2 g S - g £ - « .e 
g«22g222 2 ~« £S2;2 . 

340 

£ * 2 Gin g £ « *f « « TAT OCT CCC TAC 

345 6 Ala Z T ° Glu G1 y Tyr Ala Ala Tyr 

355 

2 2 2 £ Giu 2 2 Z £ 2 r < ICC ™ « « 

360 ^ ™ e Leu A sn Ser Tyr Met Asn Ala 

Jt>:) 370 

« - 1 2 2 S 2 g 2 S Sis 2 g - s jjc 
« g 2 2 2 g g 2 g « g. g ~ - « £ 
g 2 2 2 2 § 2 E 2 2 2 2 2 2 2 2 

^* i5 420 

2 2 % % i* s 2 2 2 2 TAGCTmcc ~™ 



425 430 



ACC^CGG GGC CA G ACCT TTCCAAATCT TC G AT GTC TC ACCATCTAA G TCTCTCAGTC 
CCC.CC TOG C G A GGAG AA C A.ACCAACCT CTCCIGACCC ™ 
AAGCATGTAA GGG1TCCAGA AACC.GACCG T GC A G CA GC T G A IG4G C GG C 

™ I" m — — 

CTCTGCCACG AAA G T G TCCA GTCTCCACAT GGCC CCTG G C G CTCTCAGTC TTTGACGAGT 
M C GCAACC mc mu 

= AGCCAAACCA ACCA G AA G CC ACTCTAATGA TATGTCACAA TAAAACCCAT 

GAATGAAAAA AAAAAAAAAA AAAAAAAAAA AAAAGAATTC 



1075 

1123 

1171 

1219 

1267 

1315 

1363 

1413 

1473 

1533 

1593 

1653 

1713 

1773 

1833 

1873 
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(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 430 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) HOLECULE TYPE: protein 

(ix) FEATURE: 

(D) OTHER INFORMATION: /product= "mOPl-PP" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met His Val Arg Ser Leu Arg Ala Ala Ala Pro His Ser Phe Val Ala 
15 10 15 

Leu Trp Ala Pro Leu Phe Leu Leu Arg Ser Ala Leu Ala Asp Phe Ser 
20 25 30 

Leu Asp Asn Glu Val His Ser Ser Phe He His Arg Arg Leu Arg Ser 
35 40 45 

Gin Glu Arg Arg Glu Met Gin Arg Glu He Leu Ser He Leu Gly Leu 
50 55 60 

Pro His Arg Pro Arg Pro His Leu Gin Gly Lys His Asn Ser Ala Pro 
65 70 75 80 

Met Phe Met Leu Asp Leu Tyr Asn Ala Met Ala Val Glu Glu Ser Gly 
85 90 95 

Pro Asp Gly Gin Gly Phe Ser Tyr Pro Tyr Lys Ala Val Phe Ser Thr 
100 105 HO 

Gin Gly Pro Pro Leu Ala Ser Leu Gin Asp Ser His Phe Leu Thr Asp 
115 120 125 

Ala Asp Met Val Met Ser Phe Val Asn Leu Val Glu His Asp Lys Glu 
130 135 140 

Phe Phe His Pro Arg Tyr His His Arg Glu Phe Arg Phe Asp Leu Ser 
145 150 155 160 

Lys He Pro Glu Gly Glu Arg Val Thr Ala Ala Glu Phe Arg He Tyr 
165 170 175 

Lys Asp Tyr He Arg Glu Arg Phe Asp Asn Glu Thr Phe Gin He Thr 
180 185 190 

Val Tyr Gin Val Leu Gin Glu His Ser Gly Arg Glu Ser Asp Leu Phe 
195 200 205 
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6 



U " % AS " S " 4 * "» Hi *> «■ «« «- Olu G ly Trp u. w 

"5 IU ^ 25 S " *" " 1S « - Pro Arg His 

Asn Leu Gly Leu Gin Leu Ser Val Glu r . 

245 1 G1U ^ Leu As P Gly Gin Ser lie 

0 Lys 5S 413 Gly Leu »• gj « is Gly Pro Cln Asn Lys 

270 

" 2« VU "J "a » r a. « Hls Uu Arg 

«« ne Ser Ihr Gly Lys Gln Ars ser ^ Arg ser Lys 

& lys is „ Gln G; „ Ala Uu Arg Met vai Aia ^ 

320 

Ser Ser Ser Asp Gln Arg Gln Ala rvc r , 

325 g Ala Cys Lys Lys His Glu Leu Tyr Val 

U 335 
ft. itg ^ Uu cly Jrp ^ ^ ^ ^ ^ ^ ^ 

350 

^ *U £ ^ ^ ^ G1 „ ^ ua im ^ ^ 

365 

^ «e; as, Ma ttt As „ „ 41a Ue val ^ ^ p 

JO 380 

ne as„ PI0 ASP ar J;J Pro Lys Pro cys iu l ^ 

J95 400 

U IU Sei X, 1 Leu ^ r Ph * «•» Jfg Ser s " - ™ „. tsu 

^ ». Asn Bet Va! V al j- Ala cys Cly Cys 

b 430 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 1723 base p airs 
to; TYPE: nucleic acid 
(C STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 
(F) TISSUE TYPE: HIPPOCAMPUS 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 490.. 1696 

(D) OTHER INFORMATION: /note= "hOP2 (cDNA) " 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
GGCGCCGGCA GAGCAGGAGT GGCTGGAGGA GCTGTGGTTG GAGCAGGAGG TGGCACGGCA 60 
GGGCTGGAGG GCTCCCTATG AGTGGCGGAG ACGGCCCAGG AGGCGCTGGA GCAACAGCTC 120 
CCACACCGCA CCAAGCGGTG GCTGCAGGAG CTCGCCCATC GCCCCTGCGC TGCTCGGACC 180 
GCGGCCACAG CCGGACTGGC GGGTACGGCG GCGACAGAGG CATTGGCCGA GAGTCCCAGT 240 

5 CCGCAGAGTA GCCCCGGCCT CGAGGCGGTG GCGTCCCGGT CCTCTCCGTC CAGGAGCCAG 300 

yd 

GACAGGTGTC GCGCGGCGGG GCTCCAGGGA CCGCGCCTGA GGCCGGCTGC CCGCCCGTCC 360 

H CGCCCCGCCC CGCCGCCCGC CGCCCGCCGA GCCCAGCCTC CTTGCCGTCG GGGCGTCCCC 420 

Lfj AGGCCCTGGG TCGGCCGCGG AGCCGATGCG CGCCCGCTGA GCGCCCCAGC TGAGCGCCCC 480 

= " CGGCCTGCC ATG ACC GCG CTC CCC GGC CCG CTC TGG CTC CTG GGC CTG 528 

Met Thr Ala Leu Pro Gly Pro Leu Trp Leu Leu Gly Leu 
S 1 5 10 

HI GCG CTA TGC GCG CTG GGC GGG GGC GGC CCC GGC CTG CGA CCC CCG CCC 576 

Hi Ala Leu Cys Ala Leu Gly Gly Gly Gly Pro Gly Leu Arg Pro Pro Pro 

JO 15 20 25 

GGC TGT CCC CAG CGA CGT CTG GGC GCG CGC GAG CGC CGG GAC GTG CAG 624 
Gly Cys Pro Gin Arg Arg Leu Gly Ala Arg Glu Arg Arg Asp Val Gin 
30 35 40 45 

CGC GAG ATC CTG GCG GTG CTC GGG CTG CCT GGG CGG CCC CGG CCC CGC 672 
Arg Glu He Leu Ala Val Leu Gly Leu Pro Gly Arg Pro Arg Pro Arg 
50 55 60 

GCG CCA CCC GCC GCC TCC CGG CTG CCC GCG TCC GCG CCG CTC TTC ATG 720 
Ala Pro Pro Ala Ala Ser Arg Leu Pro Ala Ser Ala Pro Leu Phe Met 
65 70 75 

CTG GAC CTG TAC CAC GCC ATG GCC GGC GAC GAC GAC GAG GAC GGC GCG 768 
Leu Asp Leu Tyr His Ala Met Ala Gly Asp Asp Asp Glu Asp Gly Ala 
80 85 90 
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is: 



s s 2 2 s 2 2 s 2 l tg sf « « ™ « 

95 rjE 8 la Asp Leu Val Met Ser Phe Val 

1UU 1Q5 

£ £ 2 2 £ g 2 S 2 s as ss 2 s s s 

120 125 

£ 2 £ 2 2 2 SS 2 Gin s 2 S 2 2 s s 

135 140 

£ S S g 2 5 s £ g « ccc s « go « « 

£ £ 2 £ 2 «J £ j* « CAC «J «C CAG GAG CAC XCC 

160 ?JJ Phe Gln Val Val Gin Glu Gin Ser 

Lb > 170 

£ g SS E 25 SS 2 2 2 J« 2 g - « « „ 
g 2 Sfi Giy g 2 s 2 Asp g 2 s 2 2 £ £ 

200 205 

5 2 2 £ g 2 £ 2 2 Giy 2 2 2 2 2 2 

215 220 

£ Sfi 2 1 2 £ 2 2 2 Siy 2 2 2 2 2 2 
Sfi 5 £ 2 2 2 2 2 2 2 2 2 2 2 S 2 

^<o 250 

£ £ £ £ 2 £ 2 £ 2 2 £ 2 2 2 2 2 

265 

g £ £ 2 £ g 2 £ 2 2 2 2 2 £ 2 2 
£ £ £ £ g 2 2 2 g g 2 £ 2 2 2 £ 

295 300 



816 



864 



912 



960 



1008 



1056 



1104 



1152 



1200 



1248 



1296 



1344 



1392 




CGT CGG CAC GAG CTC TAC GTC AGC TTC CAG GAC CTC GGC TGG CTG GAC 
Arg Arg His clu Leu Tyr Val Ser Phe Gin Asp Leu Gly Trp Leu Asp 
305 310 J 

TGG GTC ATC GCT CCC CAA GGC TAC TCG GCC TAT TAC TGT GAG GGG GAG 
E£ 55 He S Fro Gin Gly Tyr Ser Ala Tyr Tyr Cys Glu Gly Glu 
320 325 

TGC TCC TTC CCA CTG GAC TCC TGC ATG AAT GCC ACC AAC CAC GCC ATC 
Cys Ser Phe Pro Leu Asp Ser Cys Met Asn Ala Thr Asn His Ala lie 
335 3 *0 345 

CTG CAG TCC CTG GTG CAC CTG ATG AAG CCA AAC GCA GTC CCC AAG GCG 
Leu Gin Ser Leu Val His Leu Het Lys Pro Asn Ala Val Pro Lys Ala 

350 355 360 ^ 

TGC TGT GCA CCC ACC AAG CTG AGC GCC ACC TCT GTG CTC TAC TAT GAC 
Cys Cys Ala Pro Thr Lys Leu Ser Ala Thr Ser Val Leu Tyr Tyr Asp 
370 375 JBU 

AGC AGC AAC AAC GTC ATC CTG CGC AAA CAC CGC AAC ATG GTG GTC AAG 
Ser Ser Asn Asn Val He Leu Arg Lys His Arg Asn Met Val Val Lys 
385 390 395 

GCC TGC GGC TGC CAC T GAGTCAGCCC GCCCAGCCCT ACTGCAG 
Ala Cys Gly Cys His 
400 

(2) INFORMATION FOR SEQ ID NO: 21: 

( i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 402 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(lX) (A)OTHER INFORMATION: /product= "hOP2-PP" 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:21: 

Met Thr Ala Leu Pro Gly Pro Leu Trp Leu Leu Gly Leu Ala Leu Cys 
1 5 10 13 

Ala Leu Gly Gly Gly Gly Pro Gly Leu Arg Pro Pro Pro Gly Cys Pro 
20 25 

Gin Arg Arg Leu Gly Ala Arg Glu Arg Arg Asp Val Gin Arg Glu lie 

35 40 " 



« 
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i 



60 

Ala Ala Ser Arg Leu Pro Ala Ser 



Pro 



Ala Pro Leu Phe Met Leu Asp Leu 



* - - «. Gly Asp <sp Asp ^ ^ p 4u ^ ^ 

4rg Arg uu s ** - ^ - « - - Phe „ val 

Glu Arg Asp Arg Ala Leu Gly His ri„ n » 

lis " nis "In Glu Pro Hi c i 

115 no s Tr P L ys Glu Phe 

Arg - asp uu Thr - & - - fl ;:; Ihr Ala Aia 

140 

Glu Phe Arg Il e ^ L y s Val pr<j ^ 



150 Ile J*| Leu L eu Asn Arg Thr 

Leu His Val Ser Met Phe Gin 160 

~~ "*~ ™* UJ - U ^J-n ser Asn 

175 

Phe Leu Asn T.ai. oi_ mi 

180 



IS "* G1 " Vl1 ^ «" «n ser 4S „ 4rg Glu 

175 

s " 4SP Leu " e Phe leu - - s - - - Gly Asp Glu 

190 



Gly Trp Leu Val Leu Asp Val Thr Ah ii c 

195 200 3 Ser ASP <*» Tr P Leu 

205 



Leu 



- $ «• * Leu s leu Atg leu ^ r ^ g1q iht ^ 

s - - « - P a Gly leu Ala Gly uu ciy wn Ars ^ 

240 



" 8 S " S? »» ™ ™ - «. Phe irg Ala 

ser Pro Ile % ihr f - , g Ars Pt0 u „ Arg Arg 

p r 270 

ys & s " As " Glu - SX G1 " - - - Pro Gly a 



Ser Pro 

255 



Gin 



- a « - cly s „ is 01y irg Cln ^ ^ Arg Ars his 
s - * « ser jj. Gln Asp Leu Gly Leu Asp itp vai ne 

1!> 320 
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Ala Pro Gin Gly Tyr Ser Ala Tyr Tyr Cys Glu Gly Glu Cys Ser Phe 
325 330 335 

Pro Leu Asp Ser Cys Met Asn Ala Thr Asn His Ala lie Leu Gin Ser 
340 345 350 

Leu Val His Leu Het Lys Pro Asn Ala Val Pro Lys Ala Cys Cys Ala 
355 360 365 

Pro Thr Lys Leu Ser Ala Thr Ser Val Leu Tyr Tyr Asp Ser Ser Asn 
370 375 380 

Asn Val lie Leu Arg Lys His Arg Asn Het Val Val Lys Ala Cys Gly 
385 390 . 395 400 

Cys His 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1926 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: MURIDAE 
(F) TISSUE TYPE: EMBRYO 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 93.. 1289 

(D) OTHER INFORMATION: /note= "mOP2 cDNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:22: 

GCCAGGCACA GGTGCGCCGT CTGGTCCTCC CCGTCTGGCG TCAGCCGAGC 

CCGACCAGCT ACCAGTGGAT GCGCGCCGGC TGAAAGTCCG AG ATG GCT ATG CGT 

Met Ala Met Arg 
1 

CCC GGG CCA CTC TGG CTA TTG GGC CTT GCT CTG TGC GCG CTG GGA GGC 
Pro Gly Pro Leu Trp Leu Leu Gly Leu Ala Leu Cys Ala Leu Gly Gly 
5 10 15 20 

GGC CAC GGT CCG CGT CCC CCG CAC ACC TGT CCC CAG CGT CGC CTG GGA 
Gly His Gly Pro Arg Pro Pro His Thr Cys Pro Gin Arg Arg Leu Gly 
25 30 35 



r 

# 
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P 



2 S 2 5 5 5 2 2 5 2 S « « « « . 

45 50 

2 S 1 2 s s s 5 s 2 s S 2 S 2 SS 

s s e s g s g 2 -c 2 « « „ 

80 

5 E E E ; S 2 SS 2 His 2 5 2 2 - 

S E Met £ 2 E 2 2 52 2 S C r « -C C K « 

105 31 ?tn g ASP Ar S Thr Le " Gly 

115 

TAC CAG GAG CCA CAC TGG AAG GAA TTC CAC TTT pap ™, 
Tyr Gin Glu Pro His Trp Lys Glu Phe Ms 21 J t CTA ACC CAG ATC 
120 ' J™ Hls Phe Asp Leu Thr Gin lie 

130 

- 2 1 SS 2 2 2 2 2 2 2 g 2 SS £ 2 
S £ £ SS £ 2 2 2 £ 2 His ne 2 2 E SS 

160 

E - 2 2 SS £ 2 2 SS E 2 E S 2 E £ 
2 2 2 E § E 2 £ SS g 2 S S E 2 2 

195 

2 2 2 2 E 2 S S S 2 2 2 2 E S E 

^ u:> 210 

E 5 E 2 E 2 2 2 2 E E 2 2 E E E 
2 2 % S E E g 2 2 S 5 E 2 2 S E 

3 240 



248 



296 



344 



392 



440 



488 



536 



584 



632 



680 



728 



776 



824 
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GCA GCG AGA CCA CTr hp 260 

290 

" ss g ™ = '» I S ffi S % ffi g « E - 

370 

Es s*55ssaaBsaaaB 

395 

TGCTTCTACT ACCITACCAT CTGGCCGGGC CCCTCTCCAP A rrn 
TATCATAGCT Pap^a C CCTCTCCAG AGGCAGAAAC CCTTCTATGT 

TCAT. GCT CAGACAGGGG CAATGGCAGG CCCTTCACTT CCCCT GGCCA CTTCCTGCTA 
AAATTCTGGT CTTTCCCAGT TCCTCTPTPr ^TCCTGCTA 

««« crZz " gmtccac4c agcatcccag ~~ 

crCAccccAc IT CCACATCAGG " G ~ ™atc 

CTCACCCCAC AAXCCCAAA, TCTGGATGGT CTAAGAAGGC „C TAAACTAGAT 



872 
920 
968 

1016 

1064 
1112 



1160 
1208 
1256 
1309 

1369 

1429 

1489 
1549 
1609 
1669 



r 
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GATCTGGGCT CTCTGCACCA TTCATTGTGG CAGTTGGGAr at™ 

CATACACTTA p atp CAGTTGGGAC ATTTTTAGGT ATAACAGACA 1729 

CATACACTTA GATCAATGCA TCGCTGTACT CCTTGAAATC AGAGCTAGCT TGTTAGAAAA „ 
AGAATCAGAG CCAGGTATAG CGGTGCATGT CATTAATCCC AGCGCTAAAG A 

AGGAGAATCT CTGTGAGTTr nrr, AGCGCTAAAG AGACAGAGAC 1849 

CTGTGAGTTC AAGGCCACAT AGAAAGAGCC TGTCTCGGGA GCAGCAAAA a 
AAAAAAAAAC GGAATTC GCAGGAAAAA 1909 

1926 

(2) INFORMATION FOR SEQ ID NO:23: 

(i) ?A E ? U ^rf CHARA CTERISTICS: 

„ JFP E: mino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(ix) FEATURE : 

("> OTHER INFORMATION: /p roduct , 

(xi) SEQUENCE DESCRIPTION: SE, ID N0 . 23 . 
«e f - te Atg %0 Cly pto Uu ^ ^ ^ l ^ ^ ^ ^ 

- u. = ly ci rly His Gly pro pM ^ c ^ * 

30 

" ^ UU I? *« - - -a d 5 Pro 41a ua 

Ala Arg Gin Pro Ala Ser Ala Prn i D u 

7Q Ala Pro Leu Ph e Met Leu Asp Leu ^ ^ 

80 

1? *> ^ ^ P„ G i„ 41a His Leu Gly ^ 

AU a Uu val " et ser a - - - - ffi ^ *. Ihr 
s 01y * r Cln Clu s - «- - «. , e Asp uu fc 

- - *. Sla Gly „ u 41a val nr AU wu ^ ^ iu ™ 
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Lys Glu Pro Ser Thr His Pro Leu Asn Thr Thr Leu His lie Ser Met 

150 155 
Phe Glu Val Val Gin Glu His Ser Asn Arg Glu Ser Asp Leu Phe Phe 

165 170 
Leu Asp Leu Gin Thr Leu Arg Ser Gly Asp Glu Gly Trp Leu Val Leu 
180 185 

Asp He Thr Ala Ala Ser Asp Arg Trp Leu Leu Asn His His Lys A.p 

200 zu:) 



195 



Leu Gly Leu Arg Leu Tyr Val Glu Thr Ala Asp Gly His Ser Met Asp 
215 220 

Pro Gly Leu Ala Gly Leu Leu Gly Arg Gin Ala Pro Arg Ser Arg Gin 

230 

Pro Phe Met Val Thr Phe Phe Arg Ala Ser Gin Ser Pro Val Arg Ala 

245 250 
Pro Arg Ala Ala Arg Pro Leu Lys Arg Arg Gin Pro Lys Lys Thr Asn 

260 265 
Glu Leu Pro His Pro Asn Lys Leu Pro Gly lie Phe Asp Asp Gly His 
275 



280 285 



Gly Ser Arg Gly Arg Glu Val Cys Arg Arg His Glu Leu Tyr Val Ser 

295 

Phe Arg Asp Leu Gly Trp Leu Asp Trp Val lie Ala Pro Gin Gly Tyr 

310 315 
Ser Ala Tyr Tyr Cys Glu Gly Glu Cys Ala Phe Pro Leu Asp Ser Cys 

325 330 
Met Asn Ala Thr Asn His Ala lie Leu Gin Ser Leu Val His Leu Met 



340 



345 3 50 



Lys Pro Asp Val Val Pro Lys Ala Cys Cys Ala Pro Thr Lys Leu Ser 
355 360 365 

Ala Thr Ser Val Leu Tyr Tyr Asp Ser Ser Asn Asn Val He Leu Arg 
375 380 

Lys His Arg Asn Met Val Val Lys Ala Cys Gly Cys His 
390 395 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1368 base pairs 




(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .1368 

(D) OTHER INFORMATION: /STANDARD NAHE="60A n 

(x) PUBLICATION INFORMATION: 

(A) AUTHORS: WHARTON, KRISTI A.; THOMSEN, GERALD H. 
GELBERT, WILLIAM M. 

(B) TITLE: DROSOPHILA 60A GENE... 

(C) JOURNAL: PROC. NAT'L ACAD. SCI. USA 

(D) VOLUME: 88 

(E) RELEVANT RESIDUES IN SEQ ID NO:3: FROM 1 TO 1368 

(F) PAGES: 9214-9218 

(G) DATE: OCT - 1991 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

ATG TCG GGA CTG CGA AAC ACC TCG GAG GCC GTT GCA GTG CTC GCC TCC 
Met Ser Gly Leu Arg Asn Thr Ser Glu Ala Val Ala Val Leu Ala Ser 
1 5 io 15 

CTG GGA CTC GGA ATG GTT CTG CTC ATG TTC GTG GCG ACC ACG CCG CCG 
Leu Gly Leu Gly Met Val Leu Leu Met Phe Val Ala Thr Thr Pro Pro 
20 25 30 

GCC GTT GAG GCC ACC CAG TCG GGG ATT TAC ATA GAC AAC GGC AAG GAC 
Ala Val Glu Ala Thr Gin Ser Gly lie Tyr lie Asp Asn Gly Lys Asp 

" 40 45 

rtn ^ G ^ C f G CAC AGA GTG CTG AGC GAG GAC GAC AAG CTG GAC GTC 
Gin Thr lie Met His Arg Val Leu Ser Glu Asp Asp Lys Leu Asp Val 

3U 55 60 

TCG TAC GAG ATC CTC GAG TTC CTG GGC ATC GCC GAA CGG CCG ACG CAC 
Ser Tyr Glu He Leu Glu Phe Leu Gly lie Ala Glu Arg Pro Thr His 
65 70 75 so 

CTG AGC AGC CAC CAG TTG TCG CTG AGG AAG TCG GCT CCC AAG TTC CTG 
Leu Ser Ser His Gin Leu Ser Leu Arg Lys Ser Ala Pro Lys Phe Leu 
85 90 95 

CTG GAC GTC TAC CAC CGC ATC ACG GCG GAG GAG GGT CTC AGC GAT CAG 
Leu Asp Val Tyr His Arg lie Thr Ala Glu Glu Gly Leu Ser Asp Gin 
!00 105 no 



r 
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GAT GAG GAC GAC GAC TAC GAA CGC GGC CAT CGG TCC AGG AGG AGC GCC 
Asp Glu Asp Asp Asp Tyr Glu Arg Gly His Arg Ser Arg Arg Ser Ala 
115 120 125 



384 



GAC CTC GAG GAG GAT GAG GGC GAG CAG CAG AAG AAC TTC ATC ACC GAC 
Asp Leu Glu Glu Asp Glu Gly Glu Gin Gin Lys Asn Phe lie Thr Asp 
130 135 140 



432 



CTG GAC AAG CGG GCC ATC GAC GAG AGC GAC ATC ATC ATG ACC TTC CTG 
Leu Asp Lys Arg Ala He Asp Glu Ser Asp He He Met Thr Phe Leu 
145 150 155 160 



480 



AAC AAG CGC CAC CAC AAT GTG GAC GAA CTG CGT CAC GAG CAC GGC CGT 
Asn Lys Arg His His Asn Val Asp Glu Leu Arg His Glu His Gly Arg 
165 170 175 



528 



CGC CTG TGG TTC GAC GTC TCC AAC GTG CCC AAC GAC AAC TAC CTG GTG 
Arg Leu Trp Phe Asp Val Ser Asn Val Pro Asn Asp Asn Tyr Leu Val 
180 185 190 



576 



ATG GCC GAG CTG CGC ATC TAT CAG AAC GCC AAC GAG GGC AAG TGG CTG 
Met Ala Glu Leu Arg He Tyr Gin Asn Ala Asn Glu Gly Lys Trp Leu 
195 200 205 



624 



ACC GCC AAC AGG GAG TTC ACC ATC ACG GTA TAC GCC ATT GGC ACC GGC 
Thr Ala Asn Arg Glu Phe Thr He Thr Val Tyr Ala He Gly Thr Gly 
210 215 220 



672 



ACG CTG GGC CAG CAC ACC ATG GAG CCG CTG TCC TCG GTG AAC ACC ACC 
Thr Leu Gly Gin His Thr Met Glu Pro Leu Ser Ser VaTAsn Thr Thr 
225 230 235 240 



720 



GGG GAC TAC GTG GGC TGG TTG GAG CTC AAC GTG ACC GAG GGC CTG CAC 
Gly Asp Tyr Val Gly Trp Leu Glu Leu Asn Val Thr Glu Gly Leu His 
245 250 255 



768 



GAG TGG CTG GTC AAG TCG AAG GAC AAT CAT GGC ATC TAC ATT GGA GCA 
Glu Trp Leu Val Lys Ser Lys Asp Asn His Gly He Tyr He Gly Ala 
260 265 270 



816 



CAC GCT GTC AAC CGA CCC GAC CGC GAG GTG AAG CTG GAC GAC ATT GGA 
His Ala Val Asn Arg Pro Asp Arg Glu Val Lys Leu Asp Asp He Gly 
275 280 285 



864 



CTG ATC CAC CGC AAG GTG GAC GAC GAG TTC CAG CCC TTC ATG ATC GGC 
Leu He His Arg Lys Val Asp Asp Glu Phe Gin Pro Phe Met He Gly 
290 295 300 



912 



TTC TTC CGC GGA CCG GAG CTG ATC AAG GCG ACG GCC CAC AGC AGC CAC 
Phe Phe Arg Gly Pro Glu Leu He Lys Ala Thr Ala His Ser Ser His 
305 310 315 320 



960 
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CAC AGG AGC AAG CGA AGC GCC AGC CAT CCA CGC AAG CGC AAG AAG TCG 1008 
His Arg Ser Lys Arg Ser Ala Set His Pro Arg Lys Arg Lys Lys Ser 
325 330 335 

GTG TCG CCC AAC AAC GTG CCG CTG CTG GAA CCG ATG GAG AGC ACG CGC 1056 
Val Ser Pro Asn Asn Val Pro Leu Leu Glu Pro Met Glu Ser Thr Arg 
340 345 350 

AGC TGC CAG ATG CAG ACC CTG TAC ATA GAC TTC AAG GAT CTG GGC TGG 1104 
Ser Cys Gin Met Gin Thr Leu Tyr He Asp Phe Lys Asp Leu Gly Trp 
355 360 365 

CAT GAC TGG ATC ATC GCA CCA GAG GGC TAT GGC GCC TTC TAC TGC AGC 1152 
His Asp Trp He He Ala Pro Glu Gly Tyr Gly Ala Phe Tyr Cys Ser 
370 375 380 

GGC GAG TGC AAT TTC CCG CTC AAT GCG CAC ATG AAC GCC ACG AAC CAT 1200 
Gly Glu Cys Asn Phe Pro Leu Asn Ala His Met Asn Ala Thr Asn His 
„ 385 390 395 400 

? GCG ATC GTC CAG ACC CTG GTC CAC CTG CTG GAG CCC AAG AAG GTG CCC 1248 

^0 Ala He Val Gin Thr Leu Val His Leu Leu Glu Pro Lys Lys Val Pro 

W 405 410 415 

M AAG CCC TGC TGC GCT CCG ACC AGG CTG GGA GCA CTA CCC GTT CTG TAC 1296 

LFS Lys Pro Cys Cys Ala Pro Thr Arg Leu Gly Ala Leu Pro Val Leu Tyr 

m 420 425 430 



h CAC CTG AAC GAC GAG AAT GTG AAC CTG AAA AAG TAT AGA AAC ATG ATT 1344 

'5 His Leu Asn Asp Glu Asn Val Asn Leu Lys Lys Tyr Arg Asn Met He 

~; 435 440 445 

*j GTG AAA TCC TGC GGG TGC CAT TGA 1368 

* Val Lys Ser Cys Gly Cys His 

N 450 455 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 455 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Met Ser Gly Leu Arg Asn Thr Ser Glu Ala Val Ala Val Leu Ala Ser 
15 10 15 
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Leu Gly Leu Gly Met Val Leu Leu Met Phe Val Ala Thr Thr Pro Pro 
20 25 30 

Ala Val Glu Ala Thr Gin Ser Gly He Tyr He Asp Asn Gly Lys Asp 
35 40 45 

Gin Thr He Met His Arg Val Leu Ser Glu Asp Asp Lys Leu Asp Val 
50 55 60 

Ser Tyr Glu He Leu Glu Phe Leu Gly He Ala Glu Arg Pro Thr His 
65 70 75 80 

Leu Ser Ser His Gin Leu Ser Leu Arg Lys Ser Ala Pro Lys Phe Leu 

85 90 95 

Leu Asp Val Tyr His Arg He Thr Ala Glu Glu Gly Leu Ser Asp Gin 
100 105 no 

Asp Glu Asp Asp Asp Tyr Glu Arg Gly His Arg Ser Arg Arg Ser Ala 
H5 120 125 

Asp Leu Glu Glu Asp Glu Gly Glu Gin Gin Lys Asn Phe lie Thr Asp 
130 135 HO 

Leu Asp Lys Arg Ala He Asp Glu Ser Asp He He Met Thr Phe Leu 
145 150 155 160 

Asn Lys Arg His His Asn Val Asp Glu Leu Arg His Glu His Gly Arg 
165 170 175 

Arg Leu Trp Phe Asp Val Ser Asn Val Pro Asn Asp Asn Tyr Leu Val 
180 185 190 

Met Ala Glu Leu Arg He Tyr Gin Asn Ala Asn Glu Gly Lys Trp Leu 
195 200 205 

Thr Ala Asn Arg Glu Phe Thr He Thr Val Tyr Ala He Gly Thr Glv 
210 215 220 

Thr Leu Gly Gin His Thr Met Glu Pro Leu Ser Ser Val Asn Thr Thr 
225 230 235 240 

Gly Asp Tyr Val Gly Trp Leu Glu Leu Asn Val Thr Glu Gly Leu His 
2*5 250 255 

Glu Trp Leu Val Lys Ser Lys Asp Asn His Gly He Tyr He Gly Ala 
260 265 270 

His Ala Val Asn Arg Pro Asp Arg Glu Val Lys Leu Asp Asp He Glv 
275 280 285 

Leu He His Arg Lys Val Asp Asp Glu Phe Gin Pro Phe Met He Glv 
290 295 300 



• 6 
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PJe Phe 4rg Gly Pro ^ Uu Ue Lys 4u s ^ 

5 320 

Arg Ser Lys ig ser AU s " «■ m *» - Arg lys g, ser 

v- Ser Pr „ jj. As „ val Pro Uu clu Fro Bet oiu ser 2 

J " 350 • * 

Ser Cys Gin Met Gin Thr Leu Tyr He Asn Ph* i » 

355 3JJ 1Ae As P Phe L ys Asp Leu Gly Trp 

365 

Hi. £p Trp He ne «. £ Glu Gly ^ ^ ^ ^ 

J/:> 380 

gj «« cys 4S „ Phe p „ leu Asn ila Hls 

«a He V al Gln jg Leu M „ ls Uu uu ^ l ^ v ^ pw 

415 

Lys Pro Cys Cys Ala Pro Thr Arg Leu flv at , „ 

420 g Jf " Gly Ala Leu Pro Val Leu Tyr 

^ 430 

-s Leu Asn Asp Glu Asn Val Asn Leu Lys Lys Tyr Ar g Asn Met He 



Val Lys Ser Cys Gly Cys His 
4^0 455 



440 445 



(2) INFORMATION FOR SEQ ID N0:26: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) ORIGINAL SOURCE : 

(A) ORGANISM: Homo Sapiens 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..102 

(D) OTHER INFORMATION: /note=»BMP3» 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:26: 
(i)SEQUENCE CHARACTERISTICS- 

(B) Sp TH: 104 'acids 
10; IYPE: ammo acid 



6 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..104 

(D) OTHER INFORMATION: /note="BHP3" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

Cys Ala Arg Arg Tyr Leu Lys Val Asp Phe Ala Asp He Gly Trp Ser 
15 10 15 

Glu Trp He He Ser Pro Lys Ser Phe Asp Ala Tyr Try Cys Ser Gly 
20 25 30 

Ala Cys Gin Phe Pro Met Pro Lys Ser Leu Lys Pro Ser Asn His Ala 
35 40 45 

Thr He Gin Ser He Val Ala Arg Ala Val Gly Val Val Pro Gly He 
50 55 60 

Pro Glu Pro Cys Cys Val Pro Glu Lys Met Ser Ser Leu Ser He Leu 
65 70 75 80 

Phe Phe Asp Glu Asn Lys Asn Val Val Leu Lys Val Tyr Pro Asn Met 
85 90 95 

Thr Val Glu Ser Cys Ala Cys Arg 
100 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HOMO SAPIENS 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..102 

(D) OTHER INFORMATION: /note= "BMP5" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 



r 
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Cys Lys Lys His Glu Leu Tyr Val W Ph D * , 

1 5 Uiyr VaA Ser fje Arg Asp Leu Gly Trp Gin 

Asp Trp He He Ala Pro Glu Gly Tyr Ala Ala Phfl v , 

20 y he ^ r c y s A sp Gly 

° 30 

cys ser P „e Pro Leu As„ *U «. „ et 4s „ Ala ^ r ^ „ is ^ 

«• Val 0. fc Uu m u „ „ et phe ^ ^ 

JJ 60 7 

J» c ys cys A!a Pro Tj r Lys Uu 4s „ ^ ^ ^ ^ ^ ^ ^ 

75 80 
*P Asp Ser ser g Val Ile Leu Lys ^ ^ ^ ^ ^ ^ 



95 



Arg Ser Cys Gly Cys His 
100 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS- 

Jb! ™ : 102 ^ino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HOMO SAPIENS 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..102 

(D) OTHER INFORMATION: /note- "BMP 6" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

Cys Arg Lys His Glu Leu Tyr Val Spr Ph a n * , 
! 5 iyr val Ser Phe Gin Asp Leu Gly Trp Gin 

10 15 



Asp Trp He „ e 41a Pr0 Lys Gly ^ ^ ^ ^ 

° 30 
Glu Cys Ser P he Pro u„ 4s „ jj. „. Met As „ AU ^ ^ 



r 
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He Val Gin Thr Leu Val His Leu Het Asn Pro Glu Tyr Val Pro Lys 



60 



Pro Cys Cys Ala Pro Thr Lys Leu Asn Ala He Ser Val Leu Tyr Phe 

70 75 80 

Asp Asp Asn Ser Asn Val He Leu Lys Lys Tyr Arg Trp Met Val Val 



85 

Arg Ala Cys Gly Cys His 
100 



90 



95 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..102 

(D) OTHER INFORMATION: /label= OPX 

/note= "WHEREIN XAA AT EACH POS'N IS INDEPENDENTLY 
SELECTED FROM THE RESIDUES OCCURRING AT THE 
SHS^f nJ°™ 2 ™ C - T ^INAL- SEQUENCE OF MOUSE 
?" and" ?2.r 2 (SEE SEQ - 10 N ° S - 5 ' 6 ' 7 and 8 or 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Cys Xaa Xaa His Glu Leu Tyr Val Xaa Phe Xaa Asp Leu Gly Trp Xaa 

10 15 

Asp Trp Xaa lie Ala Pro Xaa Gly Tyr Xaa Ala Tyr Tyr Cys Glu Gly 
U 25 30 

Glu Cys Xaa Phe Pro Leu Xaa Ser Xaa Met Asn Ala Thr Asn His Ala 

40 45 

He Xaa Gin Xaa Leu Val His Xaa Xaa Xaa Pro Xaa Xaa Val Pro Lys 
JU 55 go 
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J" cys cs 41a Pt0 ^ Xaa Uu Xaa AU v ^ ^ ^ jM 

80 

*P **a Se, x aa j. Val Xaa Uu x , a ^ ^ ^ ^ ^ 

* u 95 



Xaa Ala Cys Gly Cys His 
100 



(2) INFORMATION FOR SEQ ID N0:30: 

(i)SEQUENCE CHARACTERISTICS- 

(Bj mE™ : 97 ^ acids 
" IF E: aramo acids 

(C) TOPOLOGY: linear 
ii MOLECULE TYPE: pr^teL 

(ix) FEATURE: V^ozein 

( ,nl OH**'' Generic Sequence 5 

(D) OTHER INFORMATION: herein Pa .h v • 

selected from a group of ohpoJ 33 15 inde Pendently 
defined in the speSiJatxon. "° re s P ecifie d amino acids 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:30: 

Leu Xaa Xaa Xaa Phe 

1 5 
Xaa Xaa Xaa Gly Trp Xaa Xaa Trp Xaa 

Xaa Xaa Pro Xaa Xaa Xaa Xaa Ala 

^a Tyr Cys Xaa Gly Xaa Cys Xaa 

-* 30 
Xaa Pro Xaa Xaa Xaa Xaa Xaa 

Xaa Xaa Xaa Asn His Ala Xaa Xaa 

Xaa Xaa Xaa Xaa Xaa Xaa x'aa Xaa 

v 50 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys 

Cys Xaa^Pro Xaa Xaa Xaa xll Xaa 

Xaa Xaa Xaa Leu Xaa Xaa Xaa 

Xaa Xaa Xaa Xaa Val Xaa Leu Xaa 

Xaa Xaa Xaa Xaa Met Xaa Val Xaa 

90 

Xaa Cys Xaa Cys Xaa 
95 
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(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acids 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(ix) FEATURE: 

(A) NAME: Generic Sequence 6 

(D) OTHER INFORMATION: wherein each Xaa is independently 
selected from a group of one or more specified amino acids as 
defined in the specification. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: 

Cys Xaa Xaa Xaa Xaa Leu Xaa Xaa Xaa Phe 
1 5 10 

Xaa Xaa Xaa Gly Trp Xaa Xaa Trp Xaa 
15 

Xaa Xaa Pro Xaa Xaa Xaa Xaa Ala 

20 25 
Xaa Tyr Cys Xaa Gly Xaa Cys Xaa 
30 35 
Xaa Pro Xaa Xaa Xaa Xaa Xaa 
40 

Xaa Xaa Xaa Asn His Ala Xaa Xaa 
45 50 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
55 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys 

60 65 
Cys Xaa Pro Xaa Xaa Xaa Xaa Xaa 
70 

Xaa Xaa Xaa Leu Xaa Xaa Xaa 
75 80 
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Xaa Xaa Xaa Xaa Val Xaa Leu Xaa 
85 

Xaa Xaa Xaa Xaa Met Xaa Val Xaa 

90 95 
Xaa Cys Xaa Cys Xaa 
100 



INFORMATION FOR SEQ ID NO: 32: 

SEQUENCE CHARACTERISTICS: 
LENGTH: 1238 base pairs, 372 amino acids 
TYPE: nucleic acid, amino acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) ORIGINAL SOURCE: 
(A) ORGANISM: human 
(F) TISSUE TYPE: BRAIN 



(i) 
(A) 
(B) 
(C) 
(D) 



(iv) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 

(D) OTHER INFORMATION: 

/product= "GDF-1" 
/note= "GDF-1 CDNA" 



(x) PUBLICATION INFORMATION: 

(A) AUTHORS: Lee, Se-Jin . 

(B) TITTLE: Expression of Growth/Differentiation Factor 1 

(C) JOURNAL: Proc. Nat'l Acad. Sci. 

(D) VOLUME: 88 

(E) RELEVANT RESIDUES: 1-1238 

(F) PAGES: 4250-4254 

(G) DATE: May-1991 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

GGGGACACCG GCCCCGCCCT CAGCCCACTG GTCCCGGGCC GCCGCGGACC CTGCGCACTC 

TCTGGTCATC GCCTGGGAGG AAG ATG CCA CCG CCG CAG CAA GGT CCC TGC GGC 

Met Pro Pro Pro Gin Gin Gly Pro Cys Gly 
1 5 10 

CAC CAC CTC CTC CTC CTC CTG GCC CTG CTG CTG CCC TCG CTG CCC 
His His Leu Leu Leu Leu Leu Ala Leu Leu Leu Pro Ser Leu Pro 
15 20 25 
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2 2 2 S 2 2 £ S g 2 S S S 2 2 

35 40 

2 s 2 35 2 2 5 a 2 2 g S S 25 2 

50 55 

5SS2222522ES222 

65 70 

S 2 2 2 ^ E g » « « « E « - « 

80 85 

2 2 2 2 2 2 2 2 cf ? G S 6 GTC « <*» « 

5q H1S Val Glu Glu Le " Gly Val Ala Gly Asn 

95 100 

SSS22222g222g22 
2 S 2 2 2 2 g 2 2 2 2 g 2 2 2 

125 130 

S 2 2 2 2 2 2 2 2 2 2 2 2 2 

140 145 

5 2 gS 2 2 2 2 2 2 2 2 2 2 2 2 

155 160 

5 35 2 2 2 2 2 2 2 2 2 2 g 2 f 
2 2S S 2 2 2 2 2 2 2 2 2 2 2 2 

185 19Q 

S? 2 2 2 2 2 2 2 2 2 2 2 g 2 2 

200 205 

% 2 2 g 2 2 2 2 2 2 2 2 2 2 2 

215 220 



203 



248 



293 



338 



383 



428 



473 



518 



563 



608 



653 



698 



743 



- 145 - 



CCC CGG GCC CCT GCC GCC TGC GCG CGC CTG GCC GAG GCC TCG CTG 
Pro Arg Ala Pro Ala Ala Cys Ala Arg Leu Ala Glu Ala Ser Leu 



788 



225 



CTG CTG GTG ACC CTC GAC CCG CGC CTG TGC CAC CCC CTG GCC CGG 833 
Leu Leu Val Thr Leu Asp Pro Arg Leu Cys His Pro Leu Ala Arg 
240 245 /:>u 

CCG CGG CGC GAC GCC GAA CCC GTG TTG GGC GGC GGC CCC GGG GGC 878 
Pro Arg Arg Asp Ala Glu Pro Val Leu Gly Gly Gly Pro Gly Gly 

255 260 zt}!) 

GCT TGT CGC GCG CGG CGG CTG TAC GTG AGC TTC CGC CAG GTG GGC 923 
Ala Cys Arg Ala Arg Arg Leu Tyr Val Ser Phe Arg Glu Val Gly 
270 275 280 

TGG CAC CGC TGG GTC ATC GCG CCG CGC CCC TTC CTG GCC AAC TAC 968 
Trp His Arg Trp Val lie Arg Pro Arg Gly Phe Leu Ala Asn Tyr 
H 285 290 295 

TGC CAG GGT CAG TGC GCG CTG CCC GTC GCG CTG TCG GGG TCC GGG 1013 
Cys Gin Gly Gin Cys Ala Leu Pro Val Ala Leu Ser Gly Ser Gly 
300 305 

GGG CCG CCG GCG CTC AAC CAC GCT GTG CTG CGC GCG CTC ATG CAC 1058 
Gly Pro Pro Ala Leu Asn His Ala Val Leu Arg Ala Leu Met His 
' 315 320 325 

GCG GCC GCC CCG GGA GCC GCC GAC CTG CCC TGC TGC GTG CCC GCG 1103 
Ala Ala Ala Pro Gly Ala Ala Asp Leu Pro Cys Cys Val Pro Ala 

335 iW 



330 



CGC CTG TCG CCC ATC TCC GTG CTC TTC TTT GAC AAC AGC GAC AAC 1148 
Arg Leu Ser Pro lie Ser Val Leu Phe Phe Asp Asn Ser Asp Asn 
° - 350 3- 0 



345 



GTG GTG CTG CGG CAG TAT GAG GAC ATG GTG GTG GAC GAG TGC GGC 1193 
Val Val Leu Arg Gin Tyr Glu Asp Met Val Val Asp Glu Cys Gly 
360 365 J /u 

TGC CGC TAACCCGGGG CGGGCAGGGA CCCGGGCCCA ACAATAAATG CCGCGTGG 1238 



Cys Arg 
372 



(34) INFORMATION FOR SEQ ID NO:33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 372 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: human 
(F) TISSUE TYPE: BRAIN 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 

(D) OTHER INFORMATION: /function= 
/product= "GDF-l" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 

Met Pro Pro Pro Gin Gin Gly Pro Cys Gly 
1 5 10 

His His Leu Leu Leu Leu Leu Ala Leu Leu Leu Pro Ser Leu Pro 
15 20 25 

Leu Thr Arg Ala Pro Val Pro Pro Gly Pro Ala Ala Ala Leu Leu 
30 35 40 

Gin Ala Leu Gly Leu Arg Asp Glu Pro Gin Gly Ala Pro Arg Leu 
45 50 55 

Arg Pro Val Pro Pro Val Met Trp Arg Leu Phe Arg Arg Arg Asp 
60 65 70 

Pro Gin Glu Thr Arg Ser Gly Ser Arg Arg Thr Ser Pro Gly Val 
75 80 85 

Thr Leu Gin Pro Cyc His Val Glu Glu Leu Gly Val Ala Gly Asn 
90 95 100 

lie Val Arg His lie Pro Asp Arg Gly Ala Pro Thr Arg Ala Ser 
105 110 H5 

Glu Pro Val Ser Ala Ala Gly His Cys Pro Glu Trp Thr Val Val 
120 125 130 

Phe Asp Leu Ser Ala Val Glu Pro Ala Glu Arg Pro Ser Arg Ala 
135 140 145 

Are Leu Glu Leu Arg Phe Ala Ala Ala Ala Ala Ala Ala Pro Glu 
* 150 155 160 
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Gly Gly Trp Glu Leu Ser Val Ala Gin Ala Gly Gin Gly Ala Gly 
' 165 170 

Ala Asp Pro Gly Pro Val Leu Leu Arg Gin Leu Val Pro Ala Leu 

180 lttD 

Gly Pro Pro Val Arg Ala Glu Leu Leu Gly Ala Ala Trp Ala Arg 
Asn Ala Ser Trp Pro Arg Ser Leu Arg Leu Ala Leu Ala Leu Arg 



210 



215 



Pro Arg Ala Pro Ala Ala Cys Ala Arg Leu Ala Glu Ala Ser Leu 
225 230 

Leu Leu Val Thr Leu Asp Pro Arg Leu Cys His Pro Leu Ala Arg 



240 



Pro 



Arg Arg Asp Ala Glu Pro Val Leu Gly Gly Gly Pro Gly Gly 

255 260 

Ala Cys Arg Ala Arg Arg Leu Tyr Val Ser Phe Arg Glu Val Gly 

Trp His Arg Trp Val lie Arg Pro Arg Gly Phe Leu Ala Asn Tyr 
285 290 

Cys Gin Gly Gin Cys Ala Leu Pro Val Ala Leu Ser Gly Ser Gly 
300 305 

Gly Pro Pro Ala Leu Asn His Ala Val Leu Arg Ala Leu Het His 
315 320 

Ala Ala Ala Pro Gly Ala Ala Asp Leu Pro Cys Cys Val Pro Ala 

330 335 J 

Arg Leu Ser Pro lie Ser Val Leu Phe Phe Asp Asn Ser Asp Asn 



345 



Val Val Leu Arg Gin Tyr Glu Asp Met Val Val Asp Glu Cys Gly 



360 



365 



Cys Arg 
372 



